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Peptides constructed with the 20 natural amino acids are generally considered to have little therapeutic 
potential because they are unstable in the presence of proteases and peptidases. However, proteolysis 
cleavage can be idiosyncratic, and it is possible that natural analogues of functional sequences exist that are 
highly resistant to cleavage. Here, we explored this idea in the context of peptides that bind to the signaling 
protein Gail. To do this, we used a two-step in vitro selection process to simultaneously select for protease 
resistance while retaining function-first by degrading the starting library with protease (chymotrypsin), 
followed by positive selection for binding via mRNA display. Starting from a pool of functional sequences, 
these experiments revealed peptides with 100-400 fold increases in protease resistance compared to the 
parental library. Surprisingly, selection for chymotrypsin resistance also resulted in similarly improved 
stability in human serum (—100 fold). Mechanistically, the decreases in cleavage results from both a lower 
rate of cleavage (kcat) and a weaker interaction with the protease (Km). Overall, our results demonstrate that 
the hydrolytic stability of functional, natural peptide sequences can be improved by two orders of magnitude 
simply by optimizing the primary sequence. 

The major limitation for using peptides as affinity reagents, probes, and therapeutics is their inherent 
instability in biological environments. Although the amide backbone is chemically stable, peptides are 
readily broken down in a matter of seconds in the digestive tract, in blood, plasma, serum, and inside cells 
due to the presence of proteases\ Because of this instability, many routes have been devised to chemically alter or 
modify natural peptides such as 1) including the addition of N-methylation to the backbone^, 2) insertion of P- 
amino acids^'^, 3) changing the location of the side chain (e.g., peptoids^'^), or 4) by covalent cyclization via 
insertion of chemical bridges^'^. 

Left unanswered by these studies is the question of how much a functional peptide with natural amino acids 
could be stabilized by sequence optimization alone. Previously, we demonstrated a route to create high-diversity 
cyclic peptide libraries via mRNA display^'^° and used this approach to isolate a high affinity binder termed 
cycGiBP to the signaling protein Gail-GDP^\ cycGiBP showed a very high affinity (K^ = 2.1 nM) and ~3-fold 
increase in protease resistance as compared with the corresponding linear sequence. One observation in that work 
was that only one of the three possible cleavage sites predominated when cycGiBP was subjected to chymotrypsin. 
Using the standard substrate notation for proteases (P3-P2-P1 -Pi ' -P2 ' -P3 ' ; P's represent the amino acid identity at 
a position and the scissile bond is located between Pi and P/ residue), chymotrypsin has a strong Pi preference 
for W > Y > F V^. However, cycGiBP shows cleavage at Pi = Y5, but not at Pi = W4 or ¥y^^. 

We wondered if there were other members in that library that had improved protease resistance while retaining 
binding function. To address this issue, we performed a dual selection for chymotrypsin resistance and binding 
function on a library previously only sieved for binding function. Those experiments resulted in a several highly 
protease resistant peptides and indicate that sequence optimization can improve hydrolytic stability by as much as 
400 -fold compared to peptides sequences isolated without this selective pressure. 

Methods 

E. coli expression of Gail -GDP and mRNA display. Gail -GDP with a C-terminal BirA tag was expressed and purified as previously 
described^^ mRNA display selection targeting Gail-GDP was performed starting at round 7 of our previous work and performed as 
described^ ^ with the modification that the cDNA fusions were digested with 2 mg of immobilized chymotrypsin ( Sigma- Aldrich) per 
10*^ cpm of fusions at room temperature for 15 minutes in 50 mM Sodium Phosphate buffer (pH = 8.0). The chymotrypsin beads were 
removed by centrifugation through 0.45 |j,m filters before the selection step. 
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Peptide synthesis. R6A (MSQTKRLDDQLYWWEYL), Biotin- labeled R6A (Bio- 
MSQTKRLDDQLYWWEYL), cycPRP-1 (MITWIDFISPSK), cycPRP-2 
(MTWFEYLSGSK), cycPRP-3 (MTWFEFLSSTSK), and cycGiBP 
(MITWYEFVAGTK) were synthesized on Rink Amide AM Resin LL (Novabiochem) 
and cyclized using DSG as described by Millward et aP\ After the reaction the 
cyclized peptides were purified via Cig HPLC and the mass confirmed by MALDI- 
TOF MS. 

Binding constant determination. Binding constants were determined relative to the 
R6A peptide [Kd = 60 nM]^'* by equilibrium competition using ^^[S]-Met 
radiolabeled Gail -GDP and biotinylated R6A as previously described^" and the data 
analyzed using GraphPad Prism 5.0. 

Protease resistance. Peptides (250 nmol of peptide in DMSO) were added to 50 mM 
sodium phosphate buffer (pH 8.0) with a final DMSO concentration of 2% (v/v). Sixty 
units of immobilized chymotrypsin (Sigma Aldrich) were added and allowed to 
incubate at room temperature. Aliquots were taken at various time points and 
subsequently filtered. The aliquots were then injected onto a Ci8 reverse phase 
column and separated by a gradient elution from 15 to 90% B in 25 minutes. Solvent A 
consisted of 0.1% (v/v) TEA in water and solvent B contained CH3CN with 0.1% (v/v) 
TFA. The area under the starting material peak was quantitated using the 32 
KaratGold Software package (Beckman). The graph was generated by fitting the data 
to a one phase exponential decay equation (GraphPad Prism 5.0). The mean and the 
standard error are reported in table 1. 

Km and Vmax determination. The peptides were prepared and characterized as 
described above in the protease resistance experiment. Only a single 2 minute time 
point with varying concentrations of the peptides (0, 5, 1 1, 22, 65, and 260 |iM) was 
analyzed using Michaelis-Menten enzyme kinetics regression equation (GraphPad 
Prism 5.0). The mean and the standard error are reported in table 3. 

Human serum digests. Lyophilized human serum (Thermo Scientific) was 
reconstituted by adding 2 mL ddH20 to each 5 mg vial from the manufacturer. Each 
reaction contained 250 nmol of peptide, 50 \iL of sodium phosphate buffer (pH 8.0), 
and 10% DMSO (v/v). One milliliter of reconstituted serum was added to each sample 
and incubated at 37°C. For each time point, 100 \iL aliquots were taken from the 
reaction and quenched in 300 \iL of acetonitrile. These quenched samples were 
centrifuged to separate precipitate, and the supernatant was diluted in water to 
1.5 mL. Samples were then analyzed via HPLC as described above. The mean and the 
standard error are reported in table 3. 

Circular dichroism spectroscopy (CD). Far UV-CD spectra were obtained using a 
Jasco J810 spectropolarimeter (located at the USC NanoBiophysics Core Facility) 
equipped with a Peltier device. The peptides (25-100 |iM) were prepared in 10 mM 
phosphate buffer at pH 7.4 and placed in a 1 mm path length cuvette. Thereafter, CD 
spectra were recorded in the range of 195-240 nm. Five spectra were acquired, and 
averaged. Spectra were baseline corrected by subtracting blank spectra of the 
corresponding solutions without peptide and ellipticities were converted to mean 
residue molar ellipticities in degrees cm^ dmor\ 

Results and Discussion 

Previously, we created a trillion-member mRNA display cyclic pep- 
tide library with the form MXXXXXXXXXK (termed MXiqK) and 
used seven rounds of selection to isolate cyclic peptides that bind to 
the signaling protein Gail-GDP^\ The best peptide from that selec- 
tion was cyclic GiBP (cycGiBP), a specific, 12-residue cyclic peptide 
with a Kd = 2.1 nM. Additionally, we found that cycGiBP was ~3- 
fold more resistant to chymotrypsin as compared to a linear version 
of the peptide ^\ 



In an effort to see if protease-resistant, natural-sequence peptides 
could be found, we took the Pool 7 library from our original experi- 
ment (Figure la) and subjected it to a two-step selection protocol 
(Figure lb). After cyclization with DSG, the library of mRNA peptide 
fusions was first subjected to degradation by immobilized chymo- 
trypsin for 15 minutes at room temperature and then selected for 
binding to the target of interest, here Gail -GDP. We chose the 15 
minute digestion time because preliminary experiments showed that 
more than 90% of the library was degraded under these conditions 
(data not shown) and our previous results indicated that cycGiBP 
should have been similarly degraded^ \ We reasoned this incubation 
should thus provide sufficient selection pressure to reveal protease- 
resistant sequences. After three rounds of selection, representative 
clones from Pool 10 were sequenced (Figure Ic). 

We then chose three sequences from pool 10 for further character- 
ization. We term these molecules cycPRPs for Cyclic Protease 
Resistant Peptides (cycPRP-1, cycPRP-2, and cycPRP-3). The 
cycPRP sequence consensus in Pool 10 is relatively similar to the 
consensus sequence seen in Pool 7 and also in other Gail -GDP 
binding peptides. The core sequence of the cycPRPs (TWIDFI, 
TWFEYL, TWFEFL) is very similar to cycGiBP (TWYEFV)^", R6A 
(YWWEYL)^^ KB-752 (TWYDFL)^^ the GSP peptide (TVWEFL)^^ 
and the AR6-05 peptides (YWWEFL)^^ selected by both our lab and 
others. The fact that the sequence consensus is similar suggests that 
the cycPRP peptides bind to Gail -GDP and proteolysis did not 
destroy binding. Additionally, all previously selected peptides bind 
near switch 2 of Gail -GDP, and based on this observation, we hypo- 
thesize that the cycPRPs bind to the same site on Gail -GDP. 

To verify that the cycPRPs bind to Gail -GDP as well as determine 
binding constants, the cycPRPs were initially assessed by in vitro 
pull-down with immobiUzed Gail -GDP. These data indicated that 
all three molecules were functional, with cycPRP- 3 being the best 
binder followed by cycPRP- 1 and cycPRP-2 (data not shown). 
Consistent with this view, we determined the values of cycPRP - 
3 to be 9 ± 2 nM and cycPRP- 1 to be 90 ± 20 nM for Gail -GDP. 
This binding is excellent compared to KB-752 (K^ = 3.9 uM), and on 
par with to cycGiBP (K^ = 2.1 nM), raising the possibility that the 
dual selection optimized the product of protease resistance and bind- 
ing affinity, rather than one or the other. 

The fact that the sequences retained a large fraction of aromatic 
residues in the core motif was somewhat surprising given the strong 
preference for chymotrypsin to cleave the backbone adjacent to tryp- 
tophan, tyrosine, or phenylalanine^^'^^. Indeed, our initial prediction 
was that the chymotrypsin selection would result in sequences lack- 
ing aromatic residues that still retained binding function, which 
clearly did not occur. Based on the primary sequence, cycPRP- 1, 
cycPRP-2 and cycPRP-3, each have 2-3 chymotrypsin cleavage sites 
(Figure 2), essentially the same as peptides isolated originally without 
protease challenge^ \ Initial HPLC/MALDI analysis of chymotryp- 
sin-cleaved peptides revealed that only one cut site was predominant 
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Figure 1 | mRNA Display selection for chymotrypsin resistance, (a) Pool 7 of the cyclic peptide library (MXiqK) targeting Gail -GDP was used as the 
starting point for the selection, (b) In rounds 8-10, the library was cyclized and subjected to chymotrypsin degradation and binding selection, (c) 
Representative clones from Pool 10 were sequenced (see supplemental information) and peptides cycPRP- 1, cycPRP-2, and cycPRP-3 were further 
characterized. 
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Table 1 | Chymotrypsin resistance of linear and cyclic 

Peptide Linear Half-Life (Minutes) Cyclic Half-Life (Minutes) 



GiBP 
cycPRP-1 
cycPRP-2 
cycPRP-3 



0.3 ± 0.1 
60 ±4 
63 ± 6 
36 ± 1 



2.6 ± 0.1 
90 ± 10 
90 ±40 
26 ± 2 



in each of the cycPRP sequences, similar to the result observed ori- 
ginally for cycGiBP. Interestingly, the major cut site varies, depend- 
ing on the sequence-occurring between Y-E in cycGiBP, W-I in 
cycPRP-1, and W-F in cycPRP-2 and cycPRP-3 (Figure 2). 

We next determined the half-life of the cycPRPs as substrates for 
chymotrypsin, under conditions similar to the chymotrypsin diges- 
tion during the selection (Table 1). The linear GiBP peptide provides 
a reference point to gauge improvements in protease resistance in 
this series-and we can use this fixed point to address the effects of 
protease challenge, cyclization, as well as comparing variations in the 
primary sequence. Regarding the overall effect of the selection, we 
observed a 100-200 fold improvement in linear PRPs resistance and 
a 100-300 fold increase of cycPRPs resistance to chymotrypsin, as 
compared with linear GiBP. The improved half-life of the PRPs (ti/2 
= 26 to 90 min) is also consistent with the peptides being able to 
withstand the 15 minute chymotrypsin digestion largely intact. 
Regarding cyclization, it is clear that cyclization improves chymo- 
trypsin resistance for cycGiBP, but for the protease resistant peptides, 
these gains are either quite modest (2-fold for cycPRP-1) or non- 
existent (cycPRP-3). The data thus indicate that cyclization is not the 
primary driver of chymotrypsin resistance in peptides selected using 
this protease challenge. 

There are no obvious variations in the primary sequence that 
would indicate the cycPRPs have improved chymotrypsin resistance 
as compared with cycGiBP. Put another way, there is no sequence 
model of which we are aware that predicts the cycPRPs should be 
more protease resistant than cycGiBP. Amongst the cycPRPs, the 
number of aromatic amino acids is almost identical to cycGiBP 
and there are no systematic alterations in the number of beta- 
branched amino acids, prolines, glycines, or other specific residues 
near or adjacent to the cut sites. Indeed, at least one statistical model 
predicts the W-F dipeptide seen at the Pi-P/ position in cycPRP-2 
and cycPRP-3 would be particularly susceptible to cleavage^^. 
Previous kinetic work with model substrates indicates that W at Pi 
is a markedly better substrate than Y or F (rank order Pj; W > Y > 
F)^^. In this view, cycGiBP could be seen as anomalous, since the 
primary cleavage site occurs between Y and E, even though there is a 
W-Y dipeptide step present. 

Our work here differs from model kinetic studies in that each of 
our substrates contains multiple possible cleavage sites. On the other 
hand, exhaustive analysis and alignment of many model substrates 
might be expected to reveal the importance of residues at other 
positions in the chain to the overall cleavage rate. Schellenberger et 
aP^., constructed exactly such a QSAR model based on available k^at/ 
Kin data and found that kcat/Km depends on the residue identity from 
P3 to P2' . Including residues outside this window did not improve the 
predictive power of the model, and the primary determinant at the 
P2' position was whether the residue was proline or not. While this 



cycGiBP MITWYEFVAGTK 
I I 
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I 

II I 
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t 
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Theoretical Cut Sites 
Observed Cut Sites 

Figure 2 | Theoretical (red) and observed (blue) chymotrypsin digest 
sites of peptides that bind Gail -GDP. cycGIBP is from Pool 7 of the 
original selection for function only, while the three cycPRP peptides 
(cycPRPl-3) are from Pool 10 of the protease/binding selection. Red 
arrows denote theoretical digest sites and blue arrows show the actual 
digest sited as determined by MALDI-TOF. 

model has several caveats (it treats the effect each position as inde- 
pendent and additive), the overall observations are relevant here. 
Most notable is the predicted variation in cleavage based on residue 
identity at each position. For P3 and P2, the model predicts variation 
in k^at/Km of 40-fold (A > R > G, D > P, K) and 10-fold (L,V > P > 
A > G) respectively. For Pi, the model predicts a relatively small 
variation in Y, W, and F (2-fold) and for P/ predicts a 20-fold 
variation (F, A > L, V, G). 

In order to quantitatively compare our peptides with previous 
model of chymotrypsin substrates, we determined K^t^K^ values 
for cycGiBP, cycPRP- 1, cycPRP-2, cycPRP-3 (Table 2). The KJ 
Km value for cycGiBP is 1.1 X 10^ sec~^ M"^ which is in line with 
model amide bonded substrates supporting the view that 
cycGiBP acts as normal substrate for the enzyme. Interestingly, the 
increased protease resistance of the PRP peptides originates from 
changes in both k^at and K^-with substrates showing 8-16 fold 
decreases in binding and 8-50 fold decreases in catalytic cleavage 
rates. cycPRP-2 and cycPRP- 1 have the smallest kcat/Km values, 
reduced 420-fold and 350-fold as compared to cycGiBP. This reduc- 
tion is remarkable because it substantially exceeds what would have 
been expected if all the aromatic residues had been converted to 
leucine^^. cycPRP-3 shows a 1 10-fold reduction in kcat/Km compared 
to cycGiBP, but is also the highest affinity binder. This data also 
argues that the selection optimized the product of the binding affinity 
and the reduction in kcat/K^, such that survival represents a com- 
promise between these selected traits. 

Previous observations have demonstrated that, at least with pro- 
teins, protease resistance can result from differences in protein struc- 



Table 2 | 
Peptide 


/Cm and Vmax enhancements for GiBP 
(^iM) 


and PRPs 

Vmax(liMmin-i) 


kcat/K. (M-i sec-^) 


Fold Improvement 


GiBP 


1.4 ±0.3 


37 ± 1 


1.1 X 10^ 


1 


cycPRP- 1 


1 1 ± 3 


0.81 ± 0.06 


3.1 X 102 


350 


cycPRP-2 


22 ±5 


1.4 ± 0.1 


2.6 X 102 


420 


cycPRP-3 


18 ±4 


4.2 ± 0.3 


9.9 X 102 


110 
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19,20 ^j^jig peptides are generally unstructured in solution, a 
possible mechanism that could reduce the cleavage rate of the 
PRPs would be if the peptides had significant levels of secondary 
structure. In this model, only the unfolded form of the peptide would 
be available for binding and cleavage by the enzyme. This model 
would not account for changes in k^at^ but could contribute to 
changes in by decreasing the fraction of free peptide available. 
To test this, we took the CD spectrum of both the linear and cyclic 
versions of GiBP, cycPRP-1, cycPRP-2, and cycPRP-3 in phosphate 
buffer at 10°C (Figure 3) and at 60°C (see supplemental informa- 
tion). The reduced temperature was used in order to maximize struc- 
ture formation, as peptide folding transitions are often quite broad 
due to their modest enthalpies of formation^^'^^. The largest structure 
difference between linear and cyclic molecules is seen for cycGiBP 
(Figure 3a) and very small differences are seen for cycPRP-1, 
cycPRP-2, and cycPRP-3 (Figure 3b-d). These spectra do not con- 
form simply to established model helix, sheet, or coil spectra^^. The 
lack of a minima near X = 222 nm indicates the peptides are not 
helical and the overall spectra look most similar to the anti-parallel 
beta sheet structure^l However, the spectra also are similar to 
unstructured peptides and show little changes upon heating the 
samples to 60°C (see supplemental information). The CD spectra 
and lack of temperature dependence are most consistent with both 
the linear and cyclic peptides lacking regular secondary structure, 
although they do not eliminate non- canonical folded forms. 

The lack of peptide structure and the relatively small importance 
of cyclization imply that the majority of the protease resistance 
results from sequence effects that alter the efficiency of these peptides 
as chymotrypsin substrates. This could be because the peptides are 
optimized to be poor chymotrypsin substrates or because of more 



Table 3 | Stability of I inear and cyclic peptides in human serum 
Peptide Linear Half-Life (Hours) Cyclic Half-Life (Hours) 



GiBP 
PRP-1 
PRP-3 



<0.02 
0.41 ± 0.02 
0.21 ± 0.02 



0.33 ± 0.02 
7.7 ± 0.4 
28 ±5 



general changes that make the peptides poor protease substrates. To 
test this, we incubated cycGiBP, cycPRP-1 and cycPRP-3 in human 
serum containing active proteases and peptidases and examined the 
half-life of the molecules (Table 3). 

Serum generally degrades peptides very quickly^^ due to the pres- 
ence of multiple proteases with differing specificity (e.g., thrombin, 
plasmin, and kallikrein). In line with that view, linear GiBP is unde- 
tectable at the first time point after incubation (ti/2 < 1 min) and 
cycGiBP shows a half-life of —20 minutes. Surprisingly, cyclization 
of the PRPs gives similar dramatic improvements in stability against 
serum digestion with increases of 20- to 130-fold for cycPRP-1 and 
cycPRP-3 respectively. This difference is consistent with serum ami- 
nopeptidase, which can digest linear peptides, but whose activity is 
blocked when the peptides are cyclized^^. The fact that the cyclic 
PRPs show dramatically improved stability against digestion by both 
chymotrypsin and the proteases present in serum, even though the 
peptides were never exposed to serum proteases during the selection, 
argues that the peptides have been optimized in a general way to 
resist proteolytic cleavage. This optimization is somewhat idiosyn- 
cratic, as the most stable serum peptide (cycPRP-3) is the least stable 
of the series against chymotrypsin alone. 
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Figure 3 | CD spectra of linear and cyclic versions of GiBP, PRP-1, PRP-2, and PRP-3. All spectra were taken at 10°C in 10 mM phosphate buffer pH 7.4 
with linear peptides are shown in blue and cyclic peptides in red. (a) GIBP. (b) PRP-1. (c) PRP-2. (d) PRP-3. 
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Conclusions 

Our two-step selection protocol (mRNA display using protease chal- 
lenge and binding selection) demonstrates that it is possible to isolate 
highly functional natural peptides that have the ability to bind a 
target of interest and that are resistant to proteases. This procedure 
results in stability increases ranging from 100- to 400 -fold. 
Mechanistically, the protease resistance results from both a lower 
rate of cleavage (k^at) and a weaker interaction with the enzyme 
(Km). Surprisingly, the protease resistant peptides do not contain 
dramatic sequence changes compared with non-protease resistant 
molecules. Only a few changes are needed to increase proteolytic 
stability, with 4-6 relatively conservative amino acid changes (out 
of 10 positions) needed to convert a peptide susceptible to protease 
into one resistant to protease. These changes are not predictable with 
existing heuristics/models, do not remove key residues that are sub- 
strates for the protease (here W, Y, and F), and do not appear to 
increase the ordered structure of the molecules. Rather, the changes 
appear to affect endopeptidase cleavage generally, perhaps through 
local conformational biases. Nonetheless, these results are exciting 
because they provide a route to create protease-resistant homologs of 
biologically- active peptides for use as reagents, diagnostics and ther- 
apeutics. 
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